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Abstract 

We present a simple randomized incremental algorithm for building compressed 
quadtrees. The resulting algorithm seems to be simpler than previously known algo- 
rithms for this task. 



1 Introduction 

In this note, we point out that compressed quadtrees can be built via randomized incremental 
construction. Compressed quadtrees are simple geometric data-structure. Despite their 
simplicity, they are surprisingly useful for carrying out various geometric tasks, see [HarOSj. 

The first randomized algorithm for building compressed quadtrees is due to Clarkson 
Cla83| . Eppstein et al. |EGS05| suggested building compressed quadtrees by using hierar- 



chical random sampling in a style similar to skip-lists. If one allows bitwise operations (in 
particular, interleaving the bits of two integers in constant time) one can build compressed 
quadtrees using z-order |Gar82l IRar08| by a relatively simple algorithm, but the task is more 



challenging if such operations are not allowed. 

The new algorithm we describe seems to be quite simple, and can be interpreted as a 
variant of the skip quadtree of Eppstein et al. |EGS05| . 



2 Preliminaries 

Definition 2.1 (Grid.) For a real positive number z and a point p = (x,y) in R 2 , define 
G z (p) to be the grid point (L^/^J z, [y/z\ z). Observe that G z partitions the plane into 
square regions, which we call grid cells. Formally, for any i, j G Z, the intersection of the 
half-planes x > zi, x < z{i + 1), y > zj and y < z(j + 1) is said to be a grid cell. 
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Definition 2.2 (Canonical square.) A square is a canonical square, if it is contained 
inside the unit square, it is a cell in a grid G r , and r is a power of two. 

Given a set P of n points in the unit square, a quadtree 7 is built as follows: The root 
corresponds to the unit square. Every node u6l corresponds to a cell □„ (i.e., a square), 
and it has four children. The four children correspond to the four squares formed by splitting 
Qu into four equal size squares, by horizontal and vertical cuts. The construction is recursive, 
and we start from v = root-j- As long as the current node contains more than, say, two points 
of P, we create its children, and continue recursively the construction in each child. We stop 
when each leaf of this tree contains a single point of P. 

By compressing paths (in this tree) of nodes 
that all have a single child, we get a compressed 
quadtree of size 0{n). Let QT{P) denote the 
(uniquely defined) compressed quadtree of 
P. 

A leaf in this quadtree corresponds to a 
canonical square, and a compressed edge (or 
more precisely the top vertex of this edge) cor- 
responds to an annulus formed by the set dif- 
ference of two canonical squares. We will refer 
to such region as a tile, see Figured} that is, a Figure 1: A compressed edge corresponds 
tile is either a square (corresponding to a leaf of to a tile that is the set difference of two 
the compressed quadtree) or an annulus (cor- canonical squares, 
responding to a compressed edge). As such, a 

compressed quadtree induces a partition of the unit square into these tiles. We denote the 
planar map induced by these tiles of the compressed quadtree of P by QT{P). 




3 Algorithm and analysis 
3.1 The algorithm 

Pick a random permutation (P) = (pi, . . . , p n ) of the points of P. Let be the compressed 
quadtree of Pj = {pi, . . . , p*}. In any node of T; that corresponds to a tile / of QT(Pj), we 
store a list, denoted by cl(/), of all the points of P that lie inside /. As such, any point of P 
is stored exactly once somewhere in %. We will refer to cl(/) as the conflict list of /. We 
also store for every point of P a pointer to the node of Tj that contains it. 

In the ith iteration, we find the node V{ of Tj_i = QT(Pj_i) that stores p^, and we insert 
Pi into this node. This insertion might result in at most a constant number (i.e., three) of 
new nodes being created!]] The resulting tree % is the compressed quadtree of Pj. Now, we 

J Here is a sketch why this claim is correct: Only a leaf of a compressed quadtree might contain an inserted 
point. As such, we might need to introduce a new leaf to store the new point Pi. Hanging this new leaf in 
tree might require splitting an existing compressed edge of Tj_i, by introducing a new vertex. Similarly, if 



2 



need to move all the points stored in to their new proper place in Thus, for every 
point stored in v if we check if it has to now be stored in one of the new nodes, and if so we 
move it to this new node. If there are k points in the conflict list of Vi then this iteration 
takes 0(1 + k) time. 

The compressed quadtree T„ is the required tree. 



3.2 The analysis 

Definition 3.1 Let Y be an arbitrary subset of P, and consider a tile / £ QT(Y). A set 
X C Y is a defining set for /, if / £ QT(X) and it is a minimal set with this property 
(i.e., no proper subset of X has / clS Si tile). 

The following is proved by a tedious but easy case analysis. 

Lemma 3.2 If X is a defining set of a tile f £ QT(P) then \X\ < A. 

Unlike "traditional" randomized incremental construction, the defining set is not unique 
in this case. 



Lemma 3.3 Consider a tile f £ QT(Pj). 
iteration is < A/i. Formally, we claim that 



The probability that f was created in the ith 



Pr 



/ £ or (Pi) \ ox{Pi-i) 



f e Q7(P % 



< 



Proof: Let Di, . . . , D m C Pj be all the different defining sets of /. Consider the set 

z = D x n D 2 n • • • n D m . 

Observe that / was created in the ith iteration only if pj £ Z . Indeed, if p ?; ^ Z, then 
there exists a defining set D t of / such that p» ^ D t . But then, / is also a tile of QT(Pj_i) 
as D t C Pj_i, and the probability of this tile to be created in the zth iteration is zero. 

Now, by Lemma 13.21 all the defining sets have cardinality at most four, and \Z\ < 
4. As such, the required probability is bounded by the probability that Pi is in Z. We 
bound this probability by backward analysis. Indeed, fix the set Pj and consider all possible 
permutations of this set. The probability that one of the (at most) four points of Z is the 
last point in this permutation (of i elements) is at most A/i. ■ 

Observe that the probability of a tile / to be created (according to Lemma 13731 ) is inde- 
pendent of the size of its conflict list. 

Lemma 3.4 The expected amount of work in the ith iteration is 0(1 + n/i). 

the leaf / we insert p,: into already stores an inserted point, then we need to introduce a new leaf not only 
for pi but also for the previously stored point in this leaf. This might also result in a new compressed edge 
if two points are close together compared to the diameter of /. 
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Proof: Consider a tile / G QT(Pj). The amount of work spent on it, if it was created in 
the ith iteration, is proportional to the size of its conflict list cl(/). Let Xj be the random 
variable which is the amount of work spend by the algorithm in the ith iteration. Since the 
total size of the conflict lists of % is n, we get by Lemma [3731 that the expected work in the 
ith. iteration is bounded by 



E 



Xi 



° 1+ E 7 W)I =0 1 



/eQT(Pi) 



(Again, the expectation here is over all possible permutations of P$.) Now, we have that 



E[Xi] = E 



E 



0{l + n/i). 



Theorem 3.5 Given a point set P of n points in the plane contained inside the unit square, 
one can build a compressed quadtree for P in 0{n\ogn) expected time. 

Proof: By Lemma [3741 the total expected work of the above algorithm is 0(^" =1 (1 + n/i)) = 
0(n\ogn). m 

Remark 3.6 The algorithm can also be analyzed using the results from Clarkson et al. 
[CMS93| . 



4 Discussion and conclusions 

The algorithm presented for building quadtrees works also for points in higher dimensions. 

It is natural to compare our algorithm to Eppstein et al. |EGS05| . They get a slightly 
more complicated algorithm, but they support both insertions and deletions, while our al- 
gorithm can only build the quadtree. In light of our approach, it is natural to interpret 
the algorithm of Eppstein et al. |EGS05| as a lazy randomized incremental algorithm for 
building quadtrees |dBDS95]. 

The author believes that this is a neat example of backward analysis. The reader naturally 
has the right to disagree. 
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